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ARTIFICIAL DNA BASE PAIR ANALOGUES 

The present application is a continuation-in-part of application 
serial number 099,744,. filed in the United States Patent and Trademark 
Office September 22, 1987. 
FIELD OF THE INVENTION 

This invention is directed to new DNA base pair analogues and 
methods of making and using these analogues, 
DESCRIPTION OF THE BACKGROUND ART 

The advent of simple and rapid synthetic procedures for the 
synthesis of oligodeoxynucleotides from protected deoxynucleotides has 
resulted in a substantial number of physical and biological investiga- 
tions of mismatch base pairs (Aboul-ela, et al . Nucleic Aci d Research, 
14: 4811, 1985) and investigations of base pairs where one base is an 
analog (Jiricny, et ah . Nucleic Acid Research , 14:6579, 1986). 

The question of whether it is possible to design a pair of bases 
that could function as an additional complementary base pair in the 
genetic apparatus of cells has not been explored. The criteria used to 
design complementary base pairs should address the issues of stability, 
biochemical pathways for the synthesis of (deoxy)nucleoside triphos- 
phates from bases and/or nucleosides, analog inhibition of essential 
metabolic pathways, DNA and RNA polymerase utilization of the 
(deoxy)nucleoside triphosphates, DNA polymerase error frequency and 
error correction, and the issue of mismatch base pair repair. 

Earlier very little structural or quantitative data was available 
about polymerase error frequency (Goodman, et aL , Journal of Molecu- 
lar Biology , 88 :423 » 1974 )» polymerase error correction, and mismatch 
base pair repair. These issues have been clarified significantly 
(Kramer, et aL , Cell , 38:879, 1984). An important caution for the 
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designer of complementary base pairs is that the relationship between 
the ultimate fidelity of reproduction of genetic information and the 
strength of interaction between different bases is not physically 
unique. 

SUMMARY OF THE INVENTION 

It is an object of the invention to provide new oligodeoxynucleo- 
tide base pairs comprising an artificial purine paired with an artificial 
pyrimidine wherein the artificial purine has 2,6, substituents that 
establish an interaction with 2,4 substituents of the paired artificial py- 
rimidine such that the structural integrity of the double strand is 
maintained. 

It is an additional object of the present invention to provide base 
pairs of artificial purines paired with artificial pyrimidines wherein the 
artificial purines have 2,6 substituents selected from H,0,S and NH2 
and the complementary base is an artificial pyrimidine having 2,4 sub- 
stituents selected from H f O,S and NH2» such that the 6 position of the 
artificial purine interacts with the 4 position of the artificial 
pyrimidine as a first base interaction and the 2 position of the artificial 
purine interacts with the 2 position of the artificial pyrimidine as a 
second base interaction, and wherein at least one of the first or second 
base interactions is H-S, and further such that when the base pairs are 
present in a double stranded genetic sequence containing A, T, C and G 
the structural integrity of the double strand is maintained. 

It is a further object of the invention to provide new compounds 
which can be used as artificial purines and artificial pyrimidines for 
integration into a double stranded genetic sequence as complementary 
base pairs. 
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In accordance with this invention, there are provided artificial 
pyrimidines of the formula: 

R2 




wherein Rl is hydrogen, sulfur, oxygen, or amino, R2 is hydrogen, oxy- 
gen, sulfur, or amino, and R3 is hydrogen, halogen, -SCH3, -OH, 
hydroxymethyl, alkoxyl, cyano, methylamino, nitro, or unsubstituted, or 
halogen substituted hydrocarbon groups 1-3 carbon atoms long. Also 
included are position 5 and/or 6 aza derivatives and position 1 deaza 
derivatives of these compounds. 

Further in accordance with the invention, there are provided 
artificial purines of the formula: 




wherein Rl is hydrogen, sulfur, oxygen or amino, and R2 is hydrogen, 
sulfur, oxygen or amino. Also in accordance with the invention are 
included 3-deaza and 7-deaza derivatives of these compounds. 

The present invention thus relates to double stranded genetic 
sequences having base pairs of adenine (A) and thymine (T), cytosine 
(C) and guanine (G), as well as base pairs of artificial purines paired 
with artificial pyrimidines wherein the artificial purines have 2,6 sub- 
stituents that establish an interaction, selected from hydrogen bonding 
and hydrophobic interactions, with 2,4 substituents of the paired 
artificial pyrimidines such that the structural integrity of the double 
strand is maintained. As occurs with natural base pairs, the artificial 
base pairs of the invention have a hydrogen bond between the l-purine 
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and 3-pyrimidine positions. In the base pairs of the invention, at least 
one substituted in the 2, 6 purine or 2 t 4 pyrimidine position is sulfur 
and the substituent complementary to the sulfur is hydrogen. 
Desirably, the groups which provide the interaction between comple- 
mentary artificial base pairs are sulfur and hydrogen; oxygen and hy- 
drogen; oxygen and amino; or hydrogen and hydrogen. 

The artificial purines and artificial pyrimidines and their use as 
paired bases represent a significant advance in genetics. Although 
some artificial purines and artificial pyrimidines were known, these 
prior art molecules were never employed for the formation of stable 
base pairs. In fact, prior art artificial molecules were employed to 
inhibit standard base, nucleoside, or nucleotide synthesis or to de-stabi- 
lize the target DNA as, for example, in their use as anti-cancer agents. 
In contrast, the paired artificial purines and artificial pyrimidines of 
the invention not only allow the maintenance of a stable DNA duplex, 
but also, these artificial base pairs will interact preferentially with 
each other even in the presence of naturally-occurring base pairs. 

A societal concern which has arisen with the advent of recom- 
binant DNA technology is the escape of genetically altered organisms 
and the possibly harmful affects they might have on the environment. 
If a recombinant organism incorporating the artificial base pairs of the 
invention were to escape, or be released into the environment, it would 
be unable to replicate due to the absence of the necessary artificial 
purine and artificial pyrimidine bases or nucleoside comprising the arti- 
ficial base pair. 

Using the artificial base pairs of the invention it is possible to 
design organisms that, even if they were to be released or escape into 
the environment, would be unable to replicate. Since the artificial 
base pairs of the invention cannot be synthesized by any natural 
organism -or by the recombinantly modified host organism, the incorpo- 
ration of the artificial base pairs of the invention into the genome of 
the recombinant organism will prevent replication unless the artificial 
bases are supplied exogenously as, for example, in the growth medium. 
This is because the host organism does not have the necessary 
biosynthetic machinery to allow it to synthesize the artificial base 
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pairs of the invention from other substrates, making the organism 
totally dependent upon an external source of the artificial base pairs. 
Somewhat similarly, the rate of replication of an organism can be con- 
trolled by controlling the concentration of the artificial bases in the 
growth medium or available to the organism. 

A further advantage of the artificial base pairs of the invention 
is that they can be used to produce recombinant organisms in which 
replication of the organism is synchronized from specific positions in 
the chromosome of the organism. 
DESCRIPTION OF THE DRAWINGS 

Figure 1: 

(A) A schematic representation of the base pair 5-methyl-2~ 
pyrimidinone, left base, and 6-thioguanine, right base. (B) the photo- 
graph shows the base pair 5-methyl-2-pyrimidinone/6-thioguanine 
derived from a cytosine/quanine base pair. The cytosine/guanine base 
pair is three base pairs from the end of an oligodeoxynucleotide duplex 
determined by X-ray crystallography (Dickerson, et ah . Journal of 
Molecular Biology . 149 : 761, 1981). The amino group of cytosine, the 
left base, was replaced with a hydrogen, bond length 1.09 A. The 
5-methyl group is not shown. The oxygen of guanine, the right base, 
was replaced with a sulfur, bond length 1.7 A. No other changes were 
made. The dots indicate the extent of the van der Waals radii for sul- 
fur, 1.8 A, and hydrogen, 1.2 A. 
DETAILED DESCRIPTION 

The base pairs of the invention desirably should satisfy certain 
conditions: 1. The base pair should contribute to the stability of the 
duplex molecule. 2. There should be a significant free energy discrimi- 
nation against base pairing between the new and standard bases when 
compared to either the standard base pairs or the new base pair. 

The primary chemical and biological rationalizations underlying 
the choice of the base pairs are: L The hydrogen bond between the 3- 
nitrogen of the pyrimidine and the 1-nitrogen of the purine is retained 
so that the net change in hydrogen bonds of these positions and water 
molecules will not change with duplex formation. 2. Spectroscopy in 
the gas phase suggested that the hydrogen-bond force constant of sulfur 
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is one half that of oxygen and that the most stable angle between the 
X-H axis and the symmetry axis of the sulfur bonds is 90 degrees 
instead of the 45 degrees of oxygen. 

Other substituents can be substituted at various positions as long 
as the substituent does not disrupt or de-stabilize the basic function of 
the double stranded genetic sequence molecule. 

The selection of acceptable artificial purines and complemen- 
tary artificial pyrimidines as base pairs is influenced by such bonding 
factors as, for example, hydrogen bonding and hydrophobic interactions 
as well as steric factors. For example, the artificial pyrimidines of the 
invention do not utilize the iodine atom at the 4 position because of the 
steric stability problems often created by this large atom. 

The ratio of the association constant of the duplex with the base 
pair G/T compared to A/T, Table IV, is 1/3, a value that is a factor of 9 
greater than the value determined in 1M NaCl by Aboul-ela, et al . 
( Nucleic Acid Research . 13:4811,1985). Their value is based on the 
interpretation of optical density changes during melting within the 
. framework of a two state model. Markey, et al . ( Biopolvmers . 22:1247, 
1983) have shown that the caloric enthalpy is different than the Van't 
Hoff enthalpy as the ionic strength increases. The result indicates the 
two state model may introduce a significant error in the calculation of 
association constants. A direct NMR measurement at low ionic 
strength and 15 °C of the association constants between 7-mers with a 
G/T substituted for a G/C gave a value of 1/25 (Salisbury, et al .. Journal 
of the Che mical Society. Chemical Communications , 14,985,1985). The 
equivalent ratio from Table IV is 1/42 at 19 °C. The very strong depen- 
dence of stability on the detailed sequence when G/T base pairs are 
present is illustrated by crystals of a duplex composed of oligodeoxy- 
nucleotide G-G-G-G-T-C-C-C being stable at room temperature 
(Kneale, et al .. Journal of Molecular Biology . 186:805,1985) while crys- 
tals of G-G-G-G-C-T-C-C melted above 6°C (Hunter, etal., Journal of 
Molecular Biology. 190:605,1986). On general grounds it would be 
expected that values based on a two state model of melting would over- 
estimate differences in stability while an estimate based on enzymatic 
ligation of duplexes might underestimate the difference because 
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duplexes with only partial base pairing may function as a substrate. 
The similarity between the result of the enzymatic ligation and the 
result of the NMR method is reassuring. 

Using a template oligodeoxynucleotide containing the base 2-py- 
rimidinone and the Klenow fragment of DNA polymerase I, Charczuk, 

et ah ( Nucleic Acid Research . 14:9530,1986) have presented 

preliminary evidence that no standard nucleotide is incorporated across 
from 2-pyrimidinone. 

With the exception of the G/T base pair the results of Table IV 
are consistent with the results of Aboul-ela, et ah (ibid). Both show the 
standard base pairs confer much greater stability than mismatched 
pairs of the standard bases. The base pair 6-thioguanine/5-me- 
thyl-2-pyrimidinone conferred a stability that was nearly equal to the 
standard base pair adenine/thymine. The mismatch base pair that had 
the greatest stability, guanine/5-methyl-2-pyrimidinone, apparently 
does not have the appropriate geometry for significant polymerization 
to occur with DNA polymerase I (Charlczuk, et ah . ibid}. The results 
suggest that the base pair 6-thioguanine/5-methyl-2-pyrimidinone has 
the necessary physical characteristics to be a useful complementary 
base pair. 

Experiments with Escherichia coli C600 using either tritium 
labeled 6-thioguanine or deoxyribosyl-5-methyl-2-pyrimidinone showed 
that the (deoxy)nucieoside triphosphates were synthesized. Extremely 
small amounts of the separate bases were incorporated into the DNA. 

In addition to the matters discussed above, the (deoxy)nucleoside 
triphosphates should be substrates for both RNA and DNA polymerases 
with templates containing the complementary base and the analogs or 
their derivatives should not be significant inhibitors of essential meta- 
bolic pathways. Mismatches between the new and the standard bases 
should be enzymatically correctable. 

The artificial pyrimidines can also be either 5 or 6 aza deriva- 
tives. If position 5 is carbon it can have substitution of such radicals as 
hydrogen, halogen, -SCH3, -OH, alkoxyl, cyano, methylamino, 
hydroxymethyl, nitro, and unsubstituted or halogen substituted 
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hydrocarbon groups 1-3 carbon atoms long. The artificial purines can 
be also the 3-deaza or 7-deaza derivatives. 

At least one of the 2, 6 purine or 2, 4 pyrimidine substituents is 
sulfur. Whenever a thioketo group is present at a position on one mem- 
ber of the artificial base pair, a hydrogen is present at the complemen- 
tary position of the other member of the artificial base pair. 
Desirably, in addition to the sulfur-hydrogen complementary substi- 
tuents, the artificial base pairs will have oxygen-hydrogen, or oxygen- 
amino complementary substituents. The 2, 6 purine substituents can be 
the same or different, for example, sulfur and oxygen, sulfur and 
amino, or sulfur and sulfur. Similarly, the 2, 4 pyrimidine substituents 
can be the same or different. 

Suitable artificial base pairs are: 
4-thioketo-pyrimidine and 2-thioketo-purine 
2-amino-4 thioketo-pyrimidine and 2-keto-purine 
2-thioketo-pyrimidine and 6-thioketo-purine 
2-thioketo-4-amino-pyrimidine and 6-keto-purine 
2-keto-4- thioketo-pyrimidine and* 2-amino-purine " 
2-thioketo-4-thioketo-pyrimidine and purine 
2-keto-pyrimidine and 2-amino-6-thioketo-purine 
4-thioketo-pyrimidine and 2-keto-purine 
4-keto-pyrimidine and 2-thioketo-purine 
2-keto-pyrimidine and 6-thioketo-purine 

Other suitable artificial base pairs wherein the pyrimidines can 
also be the l-deaza derivatives (pyridine derivatives), but the purines 
can not be 3-deaza derivatives are: 
2-amino-pyrimidine and 2-keto-6-thioketo-purine 
pyrimidine and 2-thioketo-6-thioketo-purine 
4-amino-pyrimidine and 2-thioketo-6-keto-purine 

The artificial purines and pyrimidines used in the base pairs can 
be synthesized using techniques known to those of ordinary skill in the 
art (see Synthetic Procedures in Nucleic Acid Chemistry . Townsend, et 
al., Eds., Part 1 (1978), 2 (1978) and 3 (1986); Zorbach, etaL, in Syn- 
thetic Procedures in Nucleic Acid Chemistry . Vol. i, (1965). The 
appropriate deoxynucleoside of a given base can be synthesized from 
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2~deoxy-3, 5-di-O-p-toluoyl-D-erythro-pentosyl chloride (Bhat, in Syn- 
thetic Procedures in Nucleic Acid Chemistry . Zorbach, et aL, Eds., Vol. 
1, p. 521, 1968) and the appropriately protected base by either: 1. the 
silyl-mercuric method in solution (Birkofer, et aL, Angew. Chem. , 
77:414, 1965) or by fusion (Kotick, et aL, Journal of Organic Chemistry . 
34:3806, 1969); or 2. the stereospecfic sodium salt method 
(Kazimierczuk, et ah, Journal of The American Chemical Society . 
106:6379, 1984). 

Synthesis of the 3- and 7-deaza purine deoxynucleosides can be 
done using the sodium salt method (Kazimierczuk, et aL, ibid) and the 
appropriate derivatives (Gingis, et ah, Nucleic Acid Research . 15:1217, 
1987). The synthesis of pyrimidine c~deoxynucleosides can be done 
using methods analogous to those described by Sato, et aL (in Nucleic 
Acid Chemistry . Townsend, et ah, Eds., Part 3, p. 81, 1978) using 
nucleosides. 

The synthesis of 1-deaza pyrimidine derivatives (pyridine deriva- 
tives) can be done using standard organic chemistry. Pyridine deriva- 
tives will react with 2-deoxy-3,5-di-0-p-toluoyl-D-erythro-pentosyl 
chloride in the presence of a Lewis acid, AICI3 or BF3, or silver 
perchlorate to give an electrophilic substitution at the 3 position of the 
pyridine derivative. 

In those instances where both the alpha and beta anomers of the 
artificial deoxynucleosides are produced, they can be separated by such 
standard techniques as, for example, differential crystallization (in 
Nucleic Acid Chemistry . Townsend, et aL, Eds., Part 2, 1978) or column 
chromatography (Lu, et aL, Oreanic Chemistry . 37:2923, 1972), 

The transformation of host organisms by incorporation of the 
artificial base pairs of the invention can be accomplished by the uptake 
of DNA in the form of linear segments or as part of a plasmid or phase 
capable of integrating its nucleic acid into the host genome. Tech- 
niques for host cell transformation are well known to those of skill in 
the art and will not be further described. 

It is preferred that the DNA contain a multiplicity of artificial 
complementary base pairs. Further, it is preferable that the DNA 
sequence containing the artificial base pairs integrate into a region of 
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the host cell genome in a location such as, for example an intron, 
where significant disruption of host genetic expression or viability will 
not be significant. 

Eucaryotic and procaryotic organisms, both aerobic and anaero- 
bic, can be used as hosts for transformation with the artificial base 
pairs of the invention. The transformed organisms can be cultured in 
aqueous media in a suitable fermentation vessel. Typically, the aqueous 
media will be, for example, maintained at about 37 °C, and near neutral 
pH and contain appropriate nutrients such as carbohydrate or glycerol 
as a carbon source, nitrogen sources such as ammonium sulfate, potas- 
sium sources such as potassium phosphate, trace elements, magnesium 
sulfate and the like. Once again, culture media and conditions will vary 
with the host organism, but are well known in the art. 

After a host organism has been constructed which contains the 
synthetic base pairs of the invention, it is then possible to regulate 
replication of the host cell by controlling the concentration of artifi- 
cial base pairs in the culture media. The appropriate concentration of 
base pairs may vary with the particular organism and with the number 
of pairs of bases in the host genome. The optimal concentration of 
synthetic base pairs in the culture media would be readily determinable 
by one of skill in the art. 

The artificial base pairs of the invention also allow recombinant 
organisms to be produced wherein replication of the organism is syn- 
chronized. This technique can be achieved using standard methods of 
restriction, enzymes and ligation to introduce a sequence of artificial 
base pairs such as, for example, the Mu phage mutant Mud I (lac, ap) 
(Casadaban, et al M Proceedings of the National Academy of Sciences . 
76:4530, 1979). The Mud genome integrates almost anywhere in the E. 
coU chromosome and clones with Mud I, integrated in a particular 
region can be selected (Casadeban, et aL, ibid), in the absence of an 
external source of artificial bases, replication will stop when the repli- 
cation complex reaches the artificial base pair in the chromosome. 
Eventually, all of the cells in the culture will arrive and stop replica- 
tion at this unique site in the chromosome. The addition of the artifi- 
cial bases will start replication of all of the cells at the same time and 
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at the same place. In addition, this method is appropriate for any 
procaryote that has an appropriate vector which is integrated into the 
host chromosome. Likewise, since there are many unique sites of initi- 
ation of replication in the eucaryotic chromosome, the use of an 
integratable vector carrying the artificial base pairs of the invention 
will allow control of the synthesis of restricted intervals of the 
eucaryotic chromosome in the same manner as described above for 
procaryotes. 

The above disclosure generally describes the present invention, 
A more complete understanding can be obtained by reference to the 
following specific examples which are provided herein for purposes of 
illustration only and are not intended to limit the scope of the 
invention. 

EXAMPLE 1 
SYNTHESIS OF 

6-THIOCUANINE/5-METHYL-PYRIMIDIN-2-ONE BASE PAIR 
A, Materials 

The chemicals employed and their sources were: beta- 
deoxynucleosides (Sigma), benzenethiol (Eastman), cetyltrime- 
thylammonium bromide (Aldrich), 2-chiorophenyl-dichlorophosphate 
(Aldrich), 4,4'-dimethoxytrityl chloride (Aldrich), long chain alkylamine 
controlled pore glass (Pierce), l-(mesity-lene-2-sulfonyl)-3-ni- 
tro-l,2,4-triazole (Aldrich), mercury (n) cyanide (Aldrich), l-me- 
thylimidazole (Aldrich), 2-nitrobenzaldoxime (Aldrich), p-nitrophenyl 
acetate (Sigma), silver carbonate (Aldrich), silica Woelm TSC (INC 
Nutritional Biochemical). All solvents were redistilled and stored under 
appropriate anhydrous conditions (Sproat, et ah . in Gait, M.J. (Ed.), 
Oligonucleotide Synthesis . IRL Press, Oxford, 1984). 

Enzymes employed and their sources were: polynucleotide 
kinase (Bethesda Reserch Labs), snake venom phosphodiesterase 
(Sigma), T4 DNA ligase (Bethesda Research Labs), Sal I (New England 
Biolabs.). 
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B. Synthesis of deoxvnucleosides 

1. 2-amino-9-(2-deoyV"beta"D 1 ribofuranosvl)"9H-purine'6-thiol 
(beta-deoxv-6-thioguanosine) . 

The beta anomer of 2-acetoamido-6-chloro-9H-(2-deoxy- 
3,5-di-o-p-toluoyl-D-ribofuranosyl) purine was prepared by a known 
procedure (Roark, et aL . Townsend, L.B. and Tipson R.S. (Eds.), Nucleic 
Acid Chemistry . John Wiley and Sons, Part 2, 583, 1978). The protected 
beta anomer of the 6-chloro derivative was deprotected and thiated 
according to Acton, et al. (in Synthetic Procedures in Nucleic Acid 
Chemistry, Zorback, et aL , Eds., Vol. 1, 272, 1968). The resulting mate- 
rial, after crystallization from hot water, had the expected ultraviolet- 
visible spectrum (Tong, et al. . Journal of Organic Chemistry . 32:859, 
1967). 

2. l-(2-deoxv-beta-D-ribofuranosyl)-5-methvl-2-pyrimidinone 
(beta-deoxvribosyl-5-methyl-2-pyrimidinone) . 

4-thiothymidine was prepared by a published procedure 
(Wempen r et al .. in Grossmen, L., and Moldare, K. (Eds.), Methods in 
Enzymology, Academic Press, XII, Part A, 75, 1967). Deoxyribosyl- 
5-methyl-2-pyrimidinone was prepared from the 4-thiothymidine by 
reduction with Raney nickel. 5.9 gm (22 mmole) of 4-thiothymidine 
was added to 180 ml of distilled water and 60 ml of ethyl alcohol in a 
flask with a reflux condenser. 24 gm of Raney nickel was added and 
the solution heated to reflux. For the highest yields a 4-thiothymidine 
solution was titrated with each Raney nickel preparation and the opti- 
cal density of a sample in 0.1 N HC1 was determined at 260, 322, and 
334 nanometers. The optical density decreased at all three wave 
lengths. The reaction was terminated when the optical density at 322 
nanometers was equal to or greater than the optical density at 334 
nanometers. The decrease in optical density at 334 nanometers from 
the initial solution was about a (actor of 6 for the maximum yield. The 
reaction was followed with silica gel thin layer chromatography (TLC) 
using isopropanol. A blue fluorescent spot, characteristic of the com- 
pound, appeared with an Rf of 0.35. The initial greenish solution 
became a* very light yellow by the end of the reaction. Because other 
products where formed the reaction was not run until all of the 
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4-thiothymidine was used. The suspension was filtered hot and the 
Raney nickel boiled with 150 ml of water. The solution was filtered 
hot. The combined filtrates were evaporated. Deoxyribosyl-5-me- 
thyl-2-pyrimidinone was purified by dry column chromatography with 
Woelm silica m. 10 ml of methanol was added to the solid from the 
Raney nickel reduction. 3.7 ml of the clear solution was added to 3.7 
gin of Woelm silica m and air dried. The dried silica with the 
deoxyribosyl-5-methyl-2-pyrimidinone was added to the top of a column 
of dry Woelm silica, 20 inches long by 1 inch diameter, contained in 
nylon tubing. Isopropanoi was the solvent. The solvent front was 
allowed to reach the bottom of the silica column before terminating 
the chromatograph. The nylon tubing was cut into 1 inch sections and 
the silica in each section was extracted with 10 ml of methanol. Sam- 
ples from each section were run on silica gel TLC with isopropanoi. 
The methanol extracts were pooled from those sections that showed 
only the fluorescent spot with Rf 0.35. The pooled silica from the 
appropriate sections was extracted again with methanol and pooled 
with the first extraction. After filtration the methanol was evapo- 
rated to a small volume and the remaining fine particles of silica were 
removed by centrifugation. The remaining methanol was evaporated 
and the residue dissolved in 10 ml of hot ethyl alcohol. The solution 
was placed at -20 C. Crystals formed overnight. The yellow superna- 
tant was decanted and the* crystals washed in cold ethanol. The volume 
was reduced to 4 ml and placed at -20 C. More crystals formed. Silica 
gel TLC revealed a contamination of less than 5%. The ultraviolet- 
visible spectrum of a solution of the crystals was equivalent to the lit- 
erature spectrum (Laland, et ah . Biochemical Journal , 90:76, 1964). 
The overall yield from 4-thiothymidine to the final product was about 
25%. 

0 

C. Synthesis of protected deoxvnucleosides 

5 , -0-4,4 , -dimethoxytritylthymidine, N-benzoyl-5 T -0-4,4'-dimeth- 
oxytrityldeoxycytidine, N-benzoyl-5 T -0-4,4 ! -dimethoxytrityldeoxyadeno- 
sine, and N-isobutyryl-5 , -0-4,4 ! -dimethoxyltrityldeoxy-guanosine were 
synthesized by standard methods (Narang, et ah . in Methods in 
Enzvmology . Wu, Ed., Vol. 68, 90, 1979). 



WO 89/02921 



PCT/US88/03214 



- 14 - 

1- N-ben20vl-deoxv-6-thioguanosine 

Preliminary experiments indicated that the N-isobutyryl and 
N-acetyl derivatives were too labile during alkaline hydrolysis to allow 
a significant yield of the N-protected nucleoside. To 0.5 gm (1.7 
mmole) of deoxy-6-thioguanosine, that had been repeatedly dried by 
evaporation of anhydrous pyridine, was added 3.3 ml of anhydrous 
pyridine and 6.6 ml of redistilled chloroform. At 4°C, 4.7 ml of 
redistilled chloroform containing 1.35 ml (12 mmoles) of benzoyl chlo- 
ride was added dropwise with stirring in a flask with a CaCl 2 drying 
tube. After the addition of the benzoyl chloride all .of the 
deoxy-6-thioguanosine went into solution and the solution was yellow. 
The solution was allowed to come to room temperature and was stirred 
for three hours. Silica gel TLC of a sample of the reaction with 
methanol/chloroform (0.5:9.5 v/v) showed one spot with UV absorption 
that turned dark brown on exposure to acid and heat. The spot was at 
the solvent front. The reaction solution was poured into 60 ml of ice. 
After melting, 10 ml of chloroform was shaken with the aqueous emul- 
sion. After separation of the phases, the organic phase contained all 
the yellow color. The- organic phase was washed three times with 20 
ml of water. The organic solution was dried with sodium sulfate and 
evaporated to an oil. 

19 ml of pyridine was added to the oil. A clear solution was 
obtained. 1.8 ml of water was added and then 19 ml of methanol was 
added. The solution was placed at 4°C and 2N NaOH was added slowly 
until a pH reading of 12.4 - 12.5 was reached. The pH was maintained 
around 12.4 by the addition of 2N NaOH. The hydrolysis was followed 
with silica gel TLC using methanol/chloroform (1.5:8.5 v/v). A major- 
ity of the OV absorbing material was present in one spot with an Rf of 
0.3. The Rf of deoxy-6-thioguanosine was 0.19. Some care was neces- 
sary with the addition of the NaOH and the time in order to avoid a 
significant production of deoxy-6-thioguanosine. The reaction was 
stopped by lowering the pH to a reading of 7.8 with 20% acetic acid. A 
considerable loss was sustained if an exchange resin was used for the 
neutralization. The solution was evaporated to an oil. 
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Pilot experiments demonstrated that the difference in solubility 
of the N-benzoyl derivative and the di, tri, tetra benzoyl derivatives in 
hot water provided a convenient method of purification. 600 ml of 
water was added to the oil and heated to 70 °C with stirring. Liquid 
was decanted from the insoluble material and placed at 4°C. A precip- 
itate formed and was recovered by filtration. The aqueous solution was 
evaporated to 250 ml and a second precipitate recovered by filtration. 
Silica gel TLC of the combined precipitates showed more than 90% was 
the N-benzoyl derivative. 

An equivalent separation procedure was to dissolve the oil in 20 
ml of chloroform and to extract repeatedly the chloroform with a 
weakly basic solution of NH4OK, pH 10. 

The identification of the 0.3 Rf spot with N-benzoyl-de- 
oxy-6-thioguanosine was based on a quantitative determination of the 
amount of benzoic acid recovered after complete alkaline hydrolysis 
and a characteristic shift of the peak of absorption from approximately 
340 nanometers to 320 nanometers when the pH was changed from 5 to 
12. A shift does not occur if a thioester is present. 
2. N-benzovl-5'-0-4 t 4 , -dimethoxvtritvl'deoxv-6-thioguanosine . 

125 mg (0.32 mmole) of the N-benzoyl-deoxy-6-thioguanosine 
was dried by repeated evaporation of anhydrous pyridine. 1.25 ml of 
anhydrous pyridine was added and 172 mg (0.5 mmole) of 4,4 , -dimethox- 
ytrityl chloride* was added at room temperature. After two hours silica 
gel* TLC with chloroform showed no N-benzoyl-deoxy-6-thioguanosine 
was present. 3 ml of methanol was added. After 15 minutes the solu- 
tion was added to 6 ml of cold water. The aqueous solution was 
extracted with 5 ml of chloroform. The chloroform was washed with 5 
ml of water and the organic phase was dried with sodium sulfate. The 
solution was evaporated to 2.5 ml and the chloroform solution was 
streaked on 4 silica gel preparative plates, 1000 u, 20 x 20 cm with a 
fluorescent indicator. Methanol/chloroform (1:9 v/v) was the solvent. 
The dimethoxyltrityl derivative was located by running an analytical 
TLC and determining which UV absorbing spot turned orange on 
spraying with acid. The bands of silica containing N-ben- 
zoyl-5 , -0-4-4 , dimethoxytrityl-deoxy-6-thioguanosine were scraped off 
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and eluted with methanol. The silica particles were removed by 
centrifugation. The methanol solutions were pooled and evaporated to 
dryness. 

3. 5 l -Q-4-4-demethoxvtritvl-deoxvribosyl-5-methvl-2-pvrimidinone . 

0.639 gm (2.5 mmole) of beta-deoxyribosyl-5-methyl-2-pyrimidi- 
none was dried by repeated evaporation of anhydrous pyridine. 8 ml of 
anhydrous pyridine and 1.1 gm (3.2 mmole) of 4,4-dimethoxytrityl chlo- 
ride was added with stirring at room temperature. All the material was 
in solution by 30 minutes. After 45 minutes a sample was analyzed by 
silica gel TLC with methanol/chloroform (1:9 v/v) and indicated the 
reaction was complete. The derivative had an Rf of 0.37. Methanol 
(1.2 ml) was added and stirred for 15 minutes. The solution was poured 
into 20 ml of ice cold water. After standing overnight at 4°C, the 
aqueous phase was decanted from the light yellow gum. The gum was 
dissolved in 10 ml of ethyl acetate. The aqueous phase was extracted 
with 8 ml of ethyl acetate and combined with the initial ethyl acetate 
solution. The ethyl acetate solution was washed with 7 ml and each of 
NaHC03, water, 1M NaCL The ethyl acetate solution was dried with 
sodium sulfate, decanted, and evaporated to a gum. The gum was dis- 
solved in 4 ml of chloroform and added to a short silica column, 20 x 
75 mm. A solution of 0.5% triethylamine in chloroform was run 
through the column until yellow color started to come off. The solvent 
was changed to methanol/chloroform (0.2:10 v/v) and 10 ml fractions 
collected. The fractions were monitored at 329 nanometers for the de- 
oxyribosyl-5-methyl-2-pyrimidinone derivative. Samples- of the frac- 
tions were analyzed on silica gel TLC with methanol/chloroform 
(1:9 v/v). The fractions that appeared pure were pooled and evapo- 
rated under vacuum. 

4. Triethvlammonium(5 , -0-4.4 , -dimethoxvtritvl-protected-deox- 
ynucleoside-3 T -Q-(2-chlorophenvl phosphate)) 

All the compounds were prepared by standard methods (Narang, 
et ah . in Wu t R. Ed., Methods in Enzvmology . 68:90, 1979). It was 
essential to check all suspectible solvents for peroxides when carrying 
out the procedures with deoxy-6-thioguanosine. 
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D. Synthesis of oligodeoxynuceotides 

The phosphotriester method (Sproat, et aL t ibid ) was used with 
solid supports of polystyrene or controlled pore glass. The final 
synthesis of oligodeoxynucleotides containing 6-thioguanine and 5— me- 
thyl-2-pyrimidinone used a glass support. 

For the synthesis of each of the oligodeoxynucleotides 
containing 6-thioguanine and 5-methyl-2-pyrimidinone 16 mgs of long 
chain alkylamine controlled pore glass was used with 0.38 moles of 
5 , -0-4 l 4 f -dimethoxytrityl-2 , -deoxyguanosine-3 l -0-succinate attached. 
The reaction solutions were prepared by adding 80 ul of anhydrous 
pyridine to 10 umoles of dry, protected nucleotide. The pyridine 
solution was added to 13 mg of l-(mesitylene-2-sulfonyl)-3-ni- 
tro-l,2,4-triazole and after one minute 8 ul of 1-methyl-imidazole was 
added. The solution was added to the solid support under nitrogen at 
room temperature. The reaction was terminated after 30 minutes by 
washing the solid support with anhydrous pyridine and dichlorome- 
thane. The dimethyoxytrityl group was removed with 2% 
trichloroacetic acid in dichlorome thane. The support was washed with 
dichloromethane and then anhydrous pyridine. The next cycle of 
nucleotide addition was started. 

E. Cleavage, Deorotection, and Regeneration 

The alkaline suspectibility of 6-thioguanine and 5-methyl-2-py- 
rimidinone precluded the use of the standard aqueous methods of 
cleavage from the solid support and the deprotection of the oligodeoxy- 
nucleotides. Model experiments demonstrated that NH4OH solutions 
converted deoxy-6-thioguanosine to deoxyguansine and several minor 
components within several hours at a temperature of 50 °C. 
Deoxyribosyl-5-methyl-2-pyrimidinone was converted to another com- 
ponent within minutes on exposure to strong alkaline conditions at 
room temperature and this compound changed over longer times into 
several components. Model experiments demonstrated that the 
deoxy-3'-0-succinate bond and the o-chlorophenylphosphate bond were 
cleaved slowly with syn-2-nitro-benzaldoximate in anhydrous pyridine. 
The isobutyryl and benzoyl groups were removed by ammonolysis in 
anhydrous methanol. 
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Model experiments with deoxy-6-thiouguanosine demonstrated 
that l-(mesitylene-2-sulfonyl)— 3-nitro-l,2,4-triazole reacted .rapidly 
with the 6-thio group under the conditions of the reaction to add 
protected nucleotide to the oligodeoxynucleotide. Deoxy-6-thioguano- 
sine was completely regenerated from the mesitylene sulfonyl adduct 
by benzenethiol in pyridine. As expected, the reaction with 
mercaptoethanol was very slow and gave more than one product. 

The glass support with the oligodeoxynucleotide was dried in 
vacuum over P2O5 and KOH. 0.45 ml of anhydrous pyridine with 55 ul 
of benzenethiol (1 M) was added to the glass under dry nitrogen gas. 
The reaction vial was tightly closed and left at room temperature for 
eight hours. The solution was removed and the glass support was 
washed four times with 1 ml of dichloromethane. The remaining di- 
chloromethane was removed by vacuum and the glass support dried 
over P2O5 and KOH. 

16,6 mg (100 umole) of syn-2-nitrobenzaldoxime was dried by 
repeated evaporation of anhydrous pyridine, 200 ul of anhydrous 
pyridine was added in a nitrogen atmosphere to dry nitrobenzaldoxime 
and then 36 ul of dry tetramethylguanidine was added. The solution and 
glass support were sealed in a vial under nitrogen and kept at room 
temperature for 5 days. The extent of the release of the oligodeoxynu- 
cleotide into the solution was followed by assaying l ul of the solution 
for the dimethoxytrityl group. 27 mg of p-nitrophenylacetate was 
added under nitrogen to the nitrobenzaldoximate solution to use up the 
remaining oximate ions. After three hours 1 ml of pyridine containing 
10 ul of 20% acetic acid was added. The pH was measured by paper to 
make sure the pH was between 7 and 8. The solution was removed 
. from the glass support and 1 ml of 50% aqueous pyridine was added to 
/ the glass support and the suspension shaken for 30 minutes. The solu- 
tion was removed and combined with the initial pyridine solution. The 
solution was evaporated to dryness. 

1 ml of dry methanol containing 9mg (25 mmole) of cetyltrime- 
thylammonium bromide was added to the dried N-protected oligodeoxy- 
nucleotide. The solution was saturated with NH3 gas at 4°C. The test 
tube was stoppered tightly and securely, and placed at room 
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temperature in the dark. After 7 days the test tube was opened at 4°C 
and the solution evaporated at room temperature with dry nitrogen. 
The residue was dissolved in 1 ml of 80% acetic acid to release dimeth- 
oxytrityl group. After 20 minutes 1 ml of water was added and the 
aqueous solution was extracted five times with 2 ml of water saturated 
diethylether. The aqueous phase was evaporated to 200 ul. The solu- 
tion was yellow. The solution was placed on a Dowex AG-50Wx2 col- 
umn, 1.5x3 cm, that had been washed with 1M NH4HCO3 and then 
washed with distilled water. The column was eluted with distilled 
water and 0.5 ml fractions were collected. The fractions were 
monitored at 260 nanometers for the oligodeoxynucleotide and the 
appropriate fractions were pooled. The pooled fractions were evapo- 
rated to dryness and stored at -20 C. 

F. Extinction Coefficients 

The extinction coefficients at 260 nanometers of the oligodeoxy- 
nucleotides listed in Table I were calculated as follows: 1. An extinc- 
tion coefficient of 10 x 10 4 M -1 cm for A-T was calculated from the 
data listed by Soher ( CRC Handbook of Biochemistry . 2nd Ed., Sober, 
H.A, (Ed.), The Chemical Rubber Company, Cleveland, OH). 2. The 

c 

extinction coefficient of the G°-T oligodeoxynucleotide was taken to 
be 10 x lO 4 !^," 1 cm since the nucleoside of 6-thioguanine has an 
extinction coefficient at 260 nanometers of 8 x lO 3 !^" 1 cm essentially 
independent of pH. 3. The extinction coefficient of the oligodeoxynu- 
cleotide containing 5-methyl-2-pyrimidinone was taken to be 9.2 x 
10 4 M~* cm because the deoxynucleotide of 5-methyl-2-pyrimidinone 
has essentially no ahsorption at 260 nanometers (Laland, et ah . ibid ). 

G. Ligation Assay 

Labeled oligodeoxynucleotide concentrations varied between 0.1 
and 0.01 uM. The carrier oligodeoxynucleotide was T-C-G-A-C-C-C-G- 
G-G and its concentration was 1 to 2 uM. The other components were 
0.06M Tris-HCl, pH 7.5, 5 mM MgC^, 5 mM dithiothreitol, 0.5 mM 
ATP, and 0.06 to 0.005 Weiss units of T4 ligase per ul. The temperature 
was 19 °C. The buffer, MgCl2 f dithiothreitol, and oligodeoxynucleotides 
were combined and subjected to the following temperature cycle: 
60-65 °C for 5 minutes, room temperature for 15 minutes, and put on 
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ice for 15 minutes. The 0.5 ml polypropylene test tubes were centri- 
fuged to return all the water to the bottom before addition of ATP and 
the ligase. 

H. Gel Electrophoresis 

. Preparative electrophoresis used 25% acrylamide gels (Gough, et 
SS-t Nucleic Acid Research . 6:1557,1979). The gels were 0.2 x 13 x 
29 cm. Before sample addition electrophoresis was carried out with 
10 M glutathione in the upper buffer to remove peroxides and free 
radicals. 1Q~ 3 M glutathione was present in the sample solution. After 
the samples entered the gel, the current was reduced -to 4 ma and 
electrophoresis was terminated when the dye, cyanol xylene, had 
traveled about 9 cm, 18-24 hours. UV absorbing bands of oligodeoxynu- 
cleotides were excised, ground into small pieces, and extracted several 
times with 0*1M NH4HCO3 over several days. 

I. HPLC Analysis 

The nucleotides were analyzed on a weak anion exchange resin, 
Syn Chropak AX100, 250 x 4.6 mm, SynChrom Inc., Linden, IN., with a 
mobile phase of .05F KH 2 P0 4 , pH 4.5. The nucleosides were analyzed 
on an octadecyl-silica column, 250 x 4.6 mm, with a mobile phase of 
2.5% methanol and .02F KH 2 P0 4 , pH 5.5. Methanol was changed to 
10% for deoxyadenosine. Using this technique it was shown that the 
new nucleotide bases were present in the oligonucleotides, 

EXAMPLE 2 

PHYSICAL CHARACTERISTICS OF SYNTHETIC 
OLIGODEOXYNUCLEOTIDES 

Table 1 lists the oligodeoxynucleotides synthesized by the solid 
phase phosphotriester method. The oligodeoxynucleotides were 
isolated by preparative electrophoresis in 25% acrylamide gels. The 
electrophoretic purity of the oligodeoxynucleotides was checked by 
labeling the 5' end with P32 phosphate and performing an 
electrophoretic analysis in 20-25% acrylamide gels. The major P32 
labeled component accounted for greater than 90% of the radioactivity 
and no other component amounted to more than 2% of the total 
radioactivity. 
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TABLE I 

0 L I GODEOXYNUC L EOT I DE S SYNTHESIZED 



Designation 3 


Sequence 


#25 


T-C-G-A-C-G-G-A-T-C-C-G 


MPO 


T-C-G-A-OG-G-A-(MPO) -C-C-G 


6TG 


T-C-G-A-C-G-G-(6TG)-T-C-C-G . 


G 


T-C-G-A-C-G-G-G-T-C-C-G 


C 


T-C-G-A-C-G-G-A-C-C-C-G 



^he bases 6-thioguanine and 5-methyl-2-pyriniidinone are 
designated 6TG or G s and MPO or T H , respectively. 



The number of nucleotide residues in the oligodeoxynucleotide 
was verified by labeling the 5* end with P32 phosphate and sequentially 
degrading the polymer with snake venom phosphodiesterase. The num- 
ber of products obtained at various times of hydrolysis of the oligode- 
oxynucleotide was determined by electrophoresis in 20-25% acrylamide 
gels at 50 °C and autoradiography. 

The ultraviolet-visible absorption spectra of the oligodeoxynu- 
cleotides containing 6-thioguanine and/or 5-methyl-2-pyrimidinone 
showed significant absorption at neutral pH in the 310 to 350 
nanometer region over the absorption spectrum of the oligodeoxynu- 
cleotide with adenine and thymine at positions X and Y. 6-thioguanine 
has absorption peaks at 265 and 340 nanometers at neutral pH. 
Beta-deoxyribosyl-5-methyl-2-pyrimidinone has one absorption peak at 
314 nanometers. In addition, beta-deoxyribosyl-5-methyl-2-pyrimidi- 
none is fluorescent in ultraviolet light. The oligodeoxynucleotides 
containing 5-methyi-2-pyrimidinone were fluorescent under ultraviolet 
light. 

Relative Stability of Base Pairs 

The traditional method of determining the association constant 
of complementary oligodeoxynucleotides is to determine the optical 
density at 260 nanometers as a function of temperature. 
Unfortunately, this technique may lead to an erroneous interpretation. 
For oligodeoxynucleotides that have short runs of complementary 
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sequences, less than about 16, the optical density transition region is 
quite broad and even assuming that the two state model is appropriate, 
the location of the temperature of the midpoint of the transition 
requires very good data. With the limited amounts of oligodeoxynucleo- 
tide synthesized in this initial study and the presence of one third of 
the sequence of the oligodeoxynucleotides as single stranded regions in 
the duplex structures, the determination of association constants by 
the analysis of optical density vs. temperature curves was not feasible. 
In place of temperature melting curves an enzymatic ligation proce- 
dure was developed to determine relative association constants. 

The basic idea of the method was to measure the concentration 
of the duplex structure of a pair of oligodeoxynucleotides by determin- 
ing the amount of. ligation of the duplex to a carrier duplex, which was 
initially present in a much higher concentration than the concentration 
of the duplex of interest. The reason the ligation of the duplex to a 
carrier molecule was chosen instead of ligation of the duplex itself was 
the simplicity of the mathematical model. The mathematics of the self 
ligation is not simple except under very restrictive conditions. The 
amount of the duplex ligated to the carrier was followed by labeling 
one of the oligodeoxynucleotides of the duplex with P32 as a 5' phos- 
phate. The carrier was phosphoryiated with cold phosphate. Both the 
carrier duplex and the duplex of interest had 5' overhanging single 
stranded regions, T-C-G-A, that were complementary. Because of the 
possibility that single stranded oligodeoxynucleotides could be ligated to 
the carrier duplex, it was necessary to determine the extent of this 
reaction. * 

When the conditions listed in the next paragraph are realized 
the concentration of oligodeoxynucleotide 1 as a function of time is: 
L lnCC(l,t)/C(l,0>] =-[K s (l)+ K d (l,2)C a (2)]F(t) 
where C(1,0) is the concentration at time zero of oligodeoxynucleo- 
tide 1, C(l,t) is the concentration at time t, K<}(1,2) is the association 
constant between oligodeoxynucleotides 1 and 2, K s (l) is the term that 
characterizes the rate of ligation of single stranded oligodeoxynucleo- 
tide 1 to the carrier molecules compared to the duplex, C a (2) is an 
average value of the concentration of oligodeoxynucleotide 2 during 
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the time of reaction, is 1 when components 1 and 2 are different 
oligodeoxy nucleotides and 2 when the oligodeoxynucleotides are the 
same, and F(t) is a function of time which depends on the amounts of 
enzyme and carrier molecule but not on the amounts of oligodeoxynu- 
cleotides 1 and 2. 

The experimental conditions necessary for equation I to be 
applicable are: 1. There is a sufficiently high concentration of the car- 
rier so that the time course of the self ligation of the carrier molecules 
is not perturbed significantly by the amount of incorporation of the 
duplex. 2. The concentration of the duplex molecule is low so that the 
rate of ligation is proportional to the concentration of the duplex. 3. 
The concentration of the duplex is sufficiently low so that self ligation 
is insignificant compared to ligation with the carrier molecule. 4. The 
rates of association of the oligodeoxynucleotides and dissociation of the 
duplex are sufficiently fast so that equilibrium is maintained during the 
time of the reaction. 5. The Km's of the duplexes being compared are 
the same, that is, the significant characteristic is the 5' overlap region 
between the duplex and the carrier molecule and not the detailed 
sequence of the rest of the duplex molecule. 

The results of the ligation of the self complementary oligodeoxy- 
nucleotide #25, Table I, to the carrier molecule are shown in Table II. 
A plot of the data from Table n clearly showed that the single stranded 
ligation predominated in the concentration range used. Duplex 
formation becomes detectable at a concentration of the oligodeoxy nu- 
cleotide above 10- 7 M. In addition, the results establish the important 
characteristic that the amount of labeled oligodeoxynucleotide ligated 
to the carrier was proportional to the amount present, a condition that 
is necessary for the validity of equation I. 

Table in contains illustrative data that is used to determine the 
ratio between the association constant of the self complementary 
oligodeoxynucleotide, #25, and the association constant of the oligode- 
oxynucleotide pair 6TG and MPO, Table I. The data is analyzed using 
equation I. Line 4 has a concentration of #25 such that the ligation of 
the single strand predominated. Equation I becomes: 
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n. K s (25)F(t) = -ln[C(25,t)/C(25,0)] 

= - -ln[6592/1336l] 
.706 

Line 3 has a concentration of #25 such that both duplex and single 
stranded molecules were ligated. Substitution into equation I gives: 
HI. K s (25)F(t)+2K d (25,25)C a (25)F(t) = -ln[57240/126053] 

= .789 

K s (25)F(t) is known from equation n and using the equation: 
C a (2) = [C(2,0) - C(2,t)]/ln [C(2,0)/C(2,t)] 
C a (25)=I.2xlO~ 7 M. Combining these results: 

IV. K d (25,25)F(t)=3.48xl0 5 

When the same analysis is carried out with lines 1 and 2 

V. K d (6TG,MPO)F(t)=2.38xl0 5 
and 

VI. K d (25,25)=1.5K d (6TG,MPO) 

Many of the measurements were performed at two different 
concentrations of enzyme between .6 and .05 Weiss units per 10 ul and 
the concentrations of all of the oligodeoxynucleo tides were varied in 
different measurements. Relative association constants were not 
affected by different enzyme concentrations or by different oligode- 
oxynucleotide concentrations that allowed duplex formation to be 
detected. The independence of the relative association constants from 
changes in enzyme concentration and oligodeoxynucleotide concentra- 
tion strongly suggests that the equilibrium condition between duplex 
and oligodeoxynucleo tides was maintained during the course of the liga- 
tion. The Independence of the relative association constants from the 
concentration of the oligodeoxynucleotides and the results shown in 
Table n indicate that the substrate concentrations were low compared 
to the K m . 

In the cases where the amount of labeled oligodeoxynucleotide 
that is ligated to the carrier molecules is small compared to the 
amount of the unligated oligodeoxynucleotide it might be objected that 
the uncertainty in counting the unligated oligodeoxynucleotide, C(t) is 
as large or larger than the amount ligated. The amount that is ligated 
is added to C(t) to obtain C(O). However, it is the quantity 
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25 



m[c(t)/c(0)] 

that appears in the equations and when the difference C(0)-C(t) is 
small compared to C(O), 

ln[C(t)/C(0>] =-[C(0)-C(t)]/C(0) 
C(0)-C(t) is the amount of radioactivity actually measured. The uncer- 
tainty in C(t) appears only in C(O). 

It was not possible to measure the relative association constant 
of the self complementary oligodeoxynucleotide 6TG-MPO, Table I, 
because every polynucleotide kinase preparation that was used to 
phosphorylate the 5' end contained an activity that rapidly cleaved the 
oligodeoxynucleotide. 

The addition of the restriction endonuclease ? Sal I, specific for 
the sequence used in the ligation, and sodium chloride to 150 mM to 
portions of the completed ligation reaction, resulted in a loss of ligated 
labeled oligodeoxynucleotide and an increase in unligated labeled 
oligodeoxynucleotide. 

TABLE II 

LIGATION OF OLIGODEOXYNUCLEOTIDE #25 TO CAKKIER 

OLIGODEOXYNUCLEOTIDE 

Ligated 

concentration at 60 minutes at 60 minutes 

(urn) (cpm) (cpm) 



.05 9.2xl0 3 5.8xl0 3 

.025 4.5xl0 3 3.5xl0 3 

.0125 2.7xl0 3 1.4xl0 3 

.00625 1.5xl0 3 4.2xl0 3 

.003125 5.2xl0 2 3.2xl0 2 



The 5' phosphorylated (P32) #25 oligodeoxynucleotide, Table I, 
was ligated to the carrier molecule T-C-G-A-C-C-OG-G-G. The 
ligation reaction was composed of the indicated concentrations of the 
labeled oligodeoxynucleotide, .06M tris-HCl, pH 7.5, 5 mM 
dithiothreitol, .5 mM ATP, 5 mM MgC^, 1 uM of 5' phosphorylated 
carrier oligodeoxynucleotide and 0.6 Weiss units of T4 DNA ligase in 10 
ul. The temperature was 19°C. Samples were diluted 1 to 4 with 80% 
formamide and 2 ul samples were analyzed by electrophoresis on a 
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12.5% acrylamide, 7M urea gel at 50 °C. After autoradiography to 
locate the #25 oligodeoxynucleotide and the ladders of ligated oligode- 
oxynucleotides, the appropriate regions were excised and counted. 

TABLE III 

LIGATION OF OLIGODEOXYNUCLEOTIDES TO CARRIER 



OLIGODEOXYKUCLEQTIDE 

Line # composition Pre-ligation Ligation 

(cpm) (cpm) 

1 A-T H * (0.4xl0~ 7 M) 10577 1350 
G S -T*(0.7xlO~ 7 M) 

2 A-T H *(0.4xlO~ 7 M) 11632 875 

3 A-T*(0.85xl0~ 7 M) 57240 68813 

4 A-T*(0.13xlO~ 7 M) 6592 6769 



The indicated oligodeoxynucleotides were ligated to the carrier 
molecule T-C-G-A-C-C-C-G-G-G. The asterisk denotes the oligodeoxy- 
nucleotide that was r 5 -phosphorylated with P32. The sequences of the 
oligodeoxynucleotides are listed in Table I. The ligation reaction was 
composed of the indicated oligodeoxynucleotides and .06M tris-HCl, pH 
7*5, 5 mM dithiothreitol, 0.5 mM A TP, 5 mM MgCl 2 , luM of 5' 
phosphorylated carrier oligodeoxynucleotide, and 0.15 Weiss units of T4 
DNA ligase in 10 ul. The temperature was 19° C and the time was 60 
minutes. An aliquot of the reaction was diluted 1 to 4 with 80% 
formamide and 2ul samples were run on a 10% acrylamide, 7M urea gel 
at 50 °C The electrophoresis was terminated when the cyano xylene 
dye had migrated about 7 cms. After autoradiography to locate the 
oligodeoxynucleotides the appropriate regions of the gel were excised 
and counted. 
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TABLE IV 



RELATIVE ASSOCIATION CONSTANTS 



Oligomer pair 



Association 
constant 



Value relative to 
K(A,T/T,A) 



(A-C/T-G) 
(A-T H /T-G ) 
(A-T/T-G) 
(A-T/T-G) 
(A-C/T-G S ) 
(A-rVT-A) 
(A-C/T-A) 
(A-T/T-G ) 




1+.5 

l7(9+3) 



K(T /G) 

K(C/G 5 ) 

K(T H /A) 

K(C/A) 

K(T/G S ) 



1/(25+5) 



1/30 
1/40 
1/40 
1/40 



Legend: The first oligomer in each expression is written 5' to 
3', the second 3* to 5\ The * " indicates that the- amount of 
duplex ligation was within the experimental uncertainty of the 
amount found in the absence of the complementary 
oligodeoxynucleotide . 



A double stranded oligodeoxynucleotide containing the base pair 
5-methyl-2-pyrimidinone/6-thioguanine (MPO/6TG) is synthesized. The 
synthetic double stranded oligodeoxynucleotide has single stranded 5 ! 
ends corresponding to a unique restriction enzyme site. The synthetic 
double stranded oligodeoxynucleotide is inserted, in vitro , into the 
unique restriction site of a double stranded DNA vector, plasmid or 
phage, using the T-4 ligase catalyzed reaction. The insertion of the 
double stranded synthetic oligodeoxynucleotide into the DNA of the 
vector causes the inactivation of a protein function specified by the 
base sequence of the DNA of the vector in the region where the inser- 
tion takes place. The protein function lost, for example, the enzyme 
beta-galactosidase, is not essential for the replication of the vector in 
infected cells. A strain of E. coU which can be transformed by the 
addition of external DNA is exposed to the 'vector carrying the 



EXAMPLE 3 



IN VIVO REPLICATION OF THE BASE PAIR 
5-METHYL-2-PYRIMIDINONE/6-THIOG U A NINE 
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synthetic double stranded oligodeoxynucleotide. The strain of E. coU is 
chosen so that the presence of a replicating vector in a cell can be 
distinguished from a cell without the vector. In addition, the E. coli 
strain has the characteristic of allowing the presence or absence of the 
non-essential protein function to be detected. The transformed cells 
are grown in the presence of the bases and/or (deoxy)nucleotides of 
MPO/6TG. After many generations of replication of the cells and their 
vectors that do not have the non-essential protein function the vector 
is isolated. The oligodeoxynucleotide present in the vector DNA is then 
analyzed for the presence of the base pair MPO/6TG. 

In general, the M13 phage and plasmid cloning methodology of 
Messing ( M13 cloning/dideoxvsequencing Instruction Manual . Bethesda, 
Research Laboratories, Gaithersburg, MD, 1986) will be used. 
Complementary oligodeoxynucleo tides with 12 or more residues will be 
synthesized using the phosphotriester and/or phosphite-triester 
methods (Gait, M.J, (Ed.) f Oligodeoxynucleotide Synthesis . IRL Press, 
Oxford, 1984). The double stranded oligodeoxynucleotide will have the 
base pair 5-methyl-2-pyrimidinone/6-thioquanine (MPO/6TG). Single 
stranded regions at each end of the double stranded oligomer will have 
the Sal I restriction site overhang sequence (SOTCGA so that the double 
stranded oligomer can be ligated to the Sal I cloning site in the vector. 

Next, the replicative form of phage Mi3mpl8 and the plasmid 
pUC8 (or 18) will be cut with Sal I and the 5 1 phosphate of the linear- 
ized vectors will be removed using alkaline phosphatase. 

The standard litigation reaction mixture for a 10 ul volume will 
use 2fmoles of linear Ml3mpl8 or pUC8 (or 18) and 10-30 pmoles of the 
complementary oligodeoxynucleotides. The high concentration of com- 
plementary oligomers is necessary because the association constant of 
the short oligomers is low. The possible incorporation of concatomers 
in the vector is not a disadvantage. 

Transformation with both M13pml8 and pUC8 (or 18) after liga- 
tion will use the DH 5 alpha strain of E. coli (Bethesda Research Labo- 
ratories) or an equivalent. In the case of the M13mpl8 the DH 5 alpha 
cells will be plated with E. coU strain JM107 (Yarisch-Perron, et al „ 
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Gene . 33:103, 1985) since F' strains are required for M13 phage 
infection. 

In the case of M13mpl8, in addition to the standard ingredients 
of the plating medium, the medium will contain the bases and/or 
(deoxy)nucleosides of MPO and 6TG at concentrations of ,001 M to 
.0001 M. Colonies from the white plaques will be isolated and purified 
on a medium with MPO and 6TG. M13pml8 phage will be produced 
from cells growing in standard medium with MPO and 6GG added. Sin- 
gle strand DNA from the phage will be isolated by the phenol method, 
(Maniatis, et ah, Molecular Cloning, A Laboratory Manual . Cold Spring 
Harbor, 1982). 

When pUC8 or 18 is used, after exposure to the plasmid, cells 
from the transformation step will be plated on standard medium with 
additions of ainpicillin, 50 ug/ml, and the bases and/or (deoxy)nucleo- 
sides of MPO and 6TG at concentrations of .001 M to .0001 M. White 
colonies will be purified and the cells grown in the presence of P32 
phosphate to label the nucleic acid. Plasmids will be isolated from the 
cells using standard methods (Maniatis, et aL . ibid). 

In analyzing the phage DNA and the plasmid DNA the plasmid 
will be digested with Sal I restriction enzyme and the small oligodeoxy- 
nucleotides purified from the larger DNA of pUC8 by acrylamide gel 
electrophoresis with the location of the bands by radioautography. The 
small oligodeoxynucleotides will be eluted from the gel and digested 
with phosphodiesterase (Maniatis, et al .. ibid ). The digest will be ana- 
lyzed by HPLC using a strong anion exchanger (for example Whatman 
PXS-1025 SAX 4.6 x 250 mm) with a linear gradient from .007 F 
KH2P0 4 ,pH 4.0, to 0.5F KC1, .25 F KH 2 P0 4 ,pH4.5. Authentic 5» deoxy- 
nucleotides of MPO and 6TG will be run with the digest. The appear- 
ance of peaks of P32 that are identical to the peaks of the added 
nucleotides of MPO and 6TG will demonstrate that the base pair 
MPO/6TG has been replicated in in vivo . 

In addition, M13mpl8 single stranded DNA will be used as a 
template with a universal primer for the synthesis of oligodeoxynucleo- 
tides. Two reaction mixtures will be used. One will contain, in 
addition, the 5'triphosphates of the MPO and 6TG deoxynucleotides. If 
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the bases 6TG and/or MPO are present in the cloning site of the phage 
DNA synthesis will terminate or the rate will be substantially reduced 
when the polymerase encounters MPO or 6TG base in the template in 
the reaction mixture with only standard bases, but will continue 
through the cloning site in the reaction mixture with MPO and 6TG de- 
oxynucleotide 5' triphosphates. One of the standard nucleoside 5' tri- 
phosphates will have an alpha P32. An analyis of the oligodeoxynucleo- 
tides synthesized in the two reaction mixtures by acrylamide gel 
electrophoresis will show much larger oligodeoxynucleotides are 
produced in the mixture with MPO and 6TG 5» triphosphates than in the 
mixture without them. As a control, a comparison will be made with 
synthetic complementary oligodeoxynucleotides with quanine/cytosine 
or adenine/thymine base pairs in place of the MPO/6TG base pairs. 

The invention now being fully described, it will be apparent to 
one of ordinary skill in the art that many changes and modifications 
can be made without departing from the spirit and scope of the 
invention. 
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CLAIMS 

1. A double stranded genetic sequence having base pairs of 
adenine (A) and thymine (T), cytosine (C) and guanine (G), as well as 
base pairs of artificial purines paired with artificial pyrimidines, 
wherein said artificial purines have 2, 6 substituents that establish an 
interaction selected from hydrogen bonding and hydrophobic 
interaction with 2, 4 substituents of paired artificial pyrimidines, and 
wherein one of said interactions is H-S and further wherein there is 
significant free energy discrimination against base pairing between said 
artificial purines and said artificial pyrimidines and the standard base 
pairs when compared to either the standard base pairs or the artificial 
base pairs, such that the structural integrity of said double strand is 
maintained. 

2. A double stranded genetic sequence of A, T ( C, G that 
includes base pairs, wherein one base is an artificial purine having 2, 6 
substituents selected from H, O, S and NH2 and the complementary 
base is an artificial pyrimidine having 2, 4 substituents selected from 
H, 0, S and NH2, such that the 6' position of said artificial purine inter- 
acts with the 4 position of said artificial pyrimidine as a first base 
interaction and the 2 position of said artificial purine interacts with 
the 2 position of said artificial pyrimidine as a second base interaction, 
and wherein at least one of said first or second base interactions is H-S, 
and further such that the structural integrity of said double strand is 
maintained* 

3. The double stranded genetic sequence of claim 2, wherein 
said first base interaction is H-S and said second base interaction is 
selected from the group consisting of H-S, H-0 and NH2-0. 

4. The double stranded genetic sequence of claim 2, wherein 
said second base interaction is H-S and said first base interaction is 
selected from the group consisting of H-S, H-0 and NH2-0 

5. A method of controlling the rate of replication of an 
organism having a double stranded genetic sequence having base pairs 
of adenine (A) and thymine (T), cytosine (C) and guanine (G), as well as 
base pairs of artificial purines paired with artificial pyrimidines, 
wherein said artificial purines have 2, 6 substituents that establish an 
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interaction selected from hydrogen bonding and hydrophobic 
interaction with 2, 4 substituents of paired artificial pyrimidines, and 
wherein one of said interactions is H-S and further wherein there is 
significant free energy discrimination against base pairing between said 
artificial purines and said artificial pyrimidines and the standard base 
pairs when compared to either the standard base pairs or the artificial 
base pairs, by controlling the concentration of said artificial purines 
and said artificial pyrimidines available to said organism. 
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